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Many biomedical experiments are carried out by pooling individual biological samples. However, 
pooling samples can potentially hide biological variance and give false confidence concerning the 
data significance. In the context of microarray experiments for detecting differentially expressed 
genes, recent publications have addressed the problem of the efficiency of sample-pooling, and some 
approximate formulas were provided for the power and sample size calculations. It is desirable to 
have exact formulas for these calculations and have the approximate results checked against the 
exact ones. We show that the difference between the approximate and exact results can be large. 

In this study, we have characterized quantitatively the effect of pooling samples on the efficiency 
of microarray experiments for the detection of differential gene expression between two classes. 
We present exact formulas for calculating the power of microarray experimental designs involving 
sample pooling and technical replications. The formulas can be used to determine the total numbers 
of arrays and biological subjects required in an experiment to achieve the desired power at a given 
significance level. The conditions under which pooled design becomes preferable to non-pooled 
design can then be derived given the unit cost associated with a microarray and that with a biological 
subject. This paper thus serves to provide guidance on sample pooling and cost effectiveness. The 
formulation in this paper is outlined in the context of performing microarray comparative studies, 
but its applicability is not limited to microarray experiments. It is also applicable to a wide range 
of biomedical comparative studies where sample pooling may be involved. 

A Java Webstart application can be accessed at http://wads.le.ac.uk/htox/WadsSoftware 
/MrcStats/SCal4Poolings.jnlp 



I. INTRODUCTION 

Pooling samples in biomedical studies has now become 
a frequent practice among many researchers. For exam- 
ple, more than 15% of the data sets deposited in the 
Gene Expression Omnibus Database involve pooled RNA 
samples 1]. The practice of pooling biological samples 
though is not a new phenomenon, as it can be traced 
back at least to 1940s and has been used in differ- 
ent application areas Q , e.g. , for the detection of certain 
medical conditions and estimation of prevalence in a pop- 
ulation. In the context of detecting differential gene ex- 
pressions using microarrays, divergent views on the wis- 
dom of pooling samples can be found in the literature 
0, H, IE S B 13 ■ One of the arguments supporting the 
practice of pooling biological samples is that biological 
variation can be reduced by pooling RNA samples in mi- 
croarray experiments^. As more carefully described by 
Kendziorski et al [l|, pooling can reduce the effects of 
biological variation, but not the biological variation it- 
self. Another argument in support of pooling samples in 
microarray experiments is that it reduces financial cost. 
However, cost reduction is meaningful only if statistical 
equivalence between the pooled and the non-pooled ex- 
perimental setups is maintained. Here we address this 
issue and present formulas to determine the conditions 
under which pooled and non-pooled designs are statisti- 
cally equivalent. 

To compare experimental designs with and without 
sample pooling the two designs must have something 
in common that can be measured, e.g., using the same 
or equivalent amount of resources, or, yielding the same 
level of detection power. Kendziorski et a/[10j used the 



width of the 95% confidence interval for gene expression 
to compare different experimental designs with and with- 
out sample pooling. The criterion was that the narrower 
the confidence interval, the more accurate the results 
from the experimental design. In a comparative study 
where two groups of biological subjects are compared 
the common goal of the different experimental designs is 
to detect a change between the two groups with a given 
power at a given false positive rate, as adopted in |9j. We 
shall use the latter method to compare different designs. 
So in this work statistical equivalence means that the de- 
signs have the same statistical power at the same level 
of significance. Therefore the more appropriate experi- 
mental design will be the one which uses less resources 
to achieve this statistical equivalence. 

The basic assumption underlying sample pooling is bi- 
ological averaging; that the measure of interest taken on 
the pool of samples is equal to the average of the same 
measure taken on each of the individual samples which 
contributed to the pool. For example in the situation 
of a microarray experiment, if r individual samples con- 
tribute equally to a pool, and the concentrations of a 
gene's mRNA transcripts for the r samples are denoted 
by Tj with i = 1, 2, • • • , r indexing the individual samples, 
the assumption of biological averaging says that the con- 
centration of this gene's mRNA transcripts in the pool 
is T = 1 /r y^j— i Ti. However, for microarray experiment 
there is some debate on whether the basic assumption 
of pooling holds. Kendziorski et al 0, 0] argue that 
there is limited support for this assumption. Here we 
do not seek to enter into this debate but rather take the 
assumption of biological averaging as valid, or at least 
approximately so, so that we are in a position to deter- 
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mine whether pooling samples is financially beneficial or 
not. The validity of biological averaging makes it possible 
(or easier) to derive a neat theoretical formulation. On a 
practical level though, the requirement for the validity of 
this assumption may not be as stringent as a theoretical 
formulation does. For instance, in [j| it was shown that 
even when biological averaging does not hold, pooling 
can be useful and inferences regarding differential gene 
expression are not adversely affected by pooling. 

One situation where there is little alternative but 
to pool biological samples is where there is insufficient 
amount of RNA from each individual biological subject 
to perform single microarray hybridization. RNA ampli- 
fication may be a possible way of obtaining more RNA, 
but may not be practically feasible when many individ- 
ual biological subjects are involved as in the case of • 
In such a circumstance, pooling samples is justified by 
the lack of alternative and will not be considered further 
here. Similarly we will not consider here the case where 
all the biological samples of the same group were pooled 
together, and multiple technical replicate measurements 
were carried out on the sample pool. This is sometimes 
seen in the literature but such an experimental de- 
sign leaves no degree of freedom to estimate the biological 
variance. Thus valid inferences about the differences be- 
tween the two populations of biological subjects under 
study cannot be made. Here we only consider situations 
other than the above two and where pooling may reduce 
the overall costs of the experiments. 



II. A GENERAL FORMALISM 

For every comparative study, there is at least one mea- 
surable quantity which is the quantity of interest. The 
goal of the study is to deduce from the data collected 
if there is any difference between the means of the two 
populations. As measuring all the biological subjects in 
two populations is rarely possible in most situations rep- 
resentatives from a population are randomly selected and 
measurements made on these. These are then taken to 
infer the properties of the population. 

Let X be the measurable quantity that is being deter- 
mined in the the experiment, e.g., the expression level of 
a gene. In the case of one-channel microarray, X could 
denote the logarithm (most commonly base 2 is used) 
of fluorescence intensity; or the logarithm of the fluores- 
cence ratio in the case of two-channel microarray. Let 
x\ denote the value of X for an individual subject i in 
the control population (c) , and x* that of the individual 
subject j in the treatment population (t). We assume 
that x%s for all individuals in the control population are 
independent normally distributed with a mean /i c and a 
variance of, denoted by x\ ~ N(fx c ,a^) for all i. Simi- 
larly, arf ~ N(nt,(Tt) f° r au j- 



A. A general experimental setup 

For a general experimental setup individual subjects 
from both populations are randomly selected and tissue 
samples collected from each. Tissue sample pools are 
made by pooling a given number r of randomly selected 
tissue samples (of the same population) together. Note 
that to make n pools we need to have selected nr individ- 
ual subjects from the population, m measurements are 
then made on each pool of tissue samples. So m is the 
number of technical replications of measurement on each 
pool. Notice that by introducing two parameters r and 
m a general and flexible experimental setup has been cre- 
ated. For instance, if we set r = 1, the experiment would 
be equivalent to no pooling of tissue samples. And if 
we set m = 1 there is no technical replication. Under 
the basic assumption of biological averaging, the result 
of pooling r tissue samples in equal proportions together 
is that the value of X for the pool is the average of those 
subjects which formed this pool, 

1 r 

x = -J2^- (1) 

»=i 

It follows that x ~ N(fi c , of/r) for a pool from the con- 
trol population, or x, ~ iV(/Xi,of jr) for a pool from the 
treated population. Note that in this paper we shall 
only discuss pooling samples with equal individual con- 
tributions. While pools formed by un-equal contributions 
from individual samples are possible, such pooled exper- 
imental design is generally less effective than the equal 
pooling, as already shown by Peng et al @ with their 
simulated results. 

When we take a measurement on a pool p, the mea- 
sured value is 

V P ,k=x p + e kl (2) 

where p indexes pools, k indexes measurements, and e k 
is a random error term assumed to be independently and 
normally distributed as e k ~ A(0,of). Hereafter of will 
be referred to as the technical variance, of the biological 
variance for the control population, and of the biological 
variance for the treatment population. 

The output of the experiment are the measurements 
on the two sets of pools. For the control group, we 
have i/p k for p = 1, . . . , n c and k = 1, . . . , m. And for 
the treatment group, we have y* k for p — 1, . . . , n t and 
k = 1, . . . , m. Here n c and n t are the numbers of pools 
prepared for the control and treatment population re- 
spectively. Our task is to infer population properties 
from these measured data. In particular, we want to 
know whether there is any difference between the two 
populations means fi c and [i t - It can be shown that 

y c =— EE£* (3) 

c P =i k=i 
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is an unbiased estimator of /i c , with a variance 



and similarly, 
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Shih et o/'s statistic for pooled design. Note that m does 
not need to equal 1. Here by incorporating two additional 
parameters r and m, the statistic to can deal with situa- 
tions where there are pooled tissue samples and multiple 
technical replications. 
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is an unbiased estimator of \x t , with a variance 
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If we make an additional assumption that the variances 
for the two populations of biological subjects are the 
same, i.e., a 2 — of = a 2 , then the difference between 

Eqs.JSJ) and ©, D = Y — Y , is an unbiased estimator 
of /i = \x t — fi c with a variance 
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The factor (cr 2 /r + a 2 /m) in Eq.J7J) can be estimated 
without bias by 



-. n t / m 

— - — y -Y 



y P ,k — Y 



It is then clear that 



{y - y c ) - (» t - mc) 

Spyjl/n c + TJnt 



(8) 



(9) 



follows the Student's t distribution with n c + n t — 2 de- 
grees of freedom. In detecting a differential gene expres- 
sion, we want to test the null hypothesis /i c = n t against 
an alternative hypothesis \i c ^ fit- So our test statistic 
is 



to 



s p \Jl/n c + l/n t 



(10) 



and there are no unknowns in Ea. flOp . Note that to 
can be seen as a generalized two-samplc-t-tcst statis- 
tic, which reduces to the statistic of the traditional two- 
sample t test with equal variance when we set the pa- 
rameters r — 1 (no pooling of tissue samples) and m = 1 
(no technic al replication of measurements). In Ref. @, 
IShih et all arrived at two separate statistics, one for non- 
pooled design, the other for pooled design. The to defined 
by Eci. (|10|l is in more general form, setting r = 1 and 
to = 1 in Ea. (|10|l recovers Shih et aPs statistic for non- 
pooled design; while setting r > 1 and m = 1 recovers 



B. Criteria of significance 

As with any statistical test we need to specify a thresh- 
old p-value Pth to claim significant results in the test. 
When all the other parameters are given, setting P t h is 
equivalent to setting a threshold, say |£|, for the statis- 
tics to defined in Ea. (|10|l . With this threshold t- value, 
our criteria for claiming a significant test is as follows: If 
to > we declare that fit — Mc > 0; if to < — 1£|, it is 
claimed as pt~Pc < 0. So the rate at which false positive 
claims are made is 

r-\€\ r°° 
Pth = / {t )dt + / 

Pn c +n t — 2 (t )dt 

J-oo J\£\ 

= 2 / p nc+nt _ 2 (t )dto = 2T ne+nt _ 2 (-|£|),(ll) 

J — oo 

where pn c +nt—2(-) is the probability density function 
(PDF) of the Student's t distribution with 7i c + 77 t — 2 
degrees of freedom, and T„ c+ „ t _2(-) is the corresponding 
cumulative probability distribution function (CDF). It is 
therefore apparent that the threshold t- value |£| can be 
obtained by solving the equation 2T„ c _|_ nt _2(— = Pth 
with a given false positive rate Pth- 



III. POWER FUNCTION 

In [ilij we presented a power function for a new statisti- 
cal t test (hereafter referred to as " two-labelling t test" ) 
in the context of using two-color microarrays to detect 
differential gene expression. Following similar steps we 
can derive the power function for the generalized two- 
sample t test presented in this paper, which reads 



S = 



/ Pn c+nt -2(Y)$ 

Jo 



-mVY =+ \fA 



y/n c + n t -2 



dY, (12) 



where p„ c+ „ t _2(F) is the PDF for the x 2 distribution 
with 77 c + n t — 2 degrees of freedom, and $(.) is the 
CDF for the standard normal distribution. The rate S 
at which a true difference between /_t t and fi c can be suc- 
cessfully detected is a function of n c , 77 t , \h\/<jd, and 
|£|. With (7 73 given by the square root of Eq.Q, and |£| 
determined by solving Ea. fll|) at a given false positive 
rate Pth, S is, eventually, a function of Pth, 77 c , 77 t , and 
ImI/ot». 

A few points are worth noting here. 
1. The two-labelling t test presented in ^| was de- 
signed to deal with systematic labelling biases generated 
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during microarray experimentation. The t test presented 
in this paper, however, assumes no systematic data bi- 
ases. In the case of two-color microarrays this requires a 
common reference design. In such an experimental design 
the labelling biases cancel themselves out in the calcula- 
tion of the test statistic. 

2. In the biological variances of the two popu- 
lations under comparison do not have to be the same, 
that is, we did not assume a 2 = erf. For the t test in 
this paper, we have made an additional assumption that 
of = of. Relaxing this requirement was possible, as in 
the case of the traditional two-sample t test with unequal 
variance |14| . but an exact power function could not be 
readily obtained. 

3. The exact power function obtained in this paper 
allows evaluation of the effects of pooling biological sam- 
ples and the effects of taking multiple technical measure- 
ments, thus giving researchers quantitative guidance on 
the practice of pooling samples. 

4. By setting the parameters r — 1 and m — 1, an 
exact power function is provided for the traditional two- 
sample t test with equal variance. 

IV. RESULTS 

We have implemented the computation of the power 
function S of Ea. (|12|l as a Java application, which can 
be accessed at the URL given in the abstract. Here we 
apply this to microarray comparative studies for finding 
differentially expressed genes, and investigate the effect 
of pooling RNA samples in the experiments. We also 
compare our exact results with some approximate results 
presented by other authors to demonstrate why an 
exact formula is desirable. 



A. Comparison with approximate results 

Based on their approximate formulas, IShih et alJ con- 
sidered two scenarios to compare the number of bio- 
logical subjects and number of microarrays in the non- 
pooled and pooled designs 0- Here we give exact re- 
sults for the two scenarios to show the difference to 
the approximate results. In the first scenario, we con- 
sider that the common biological variance of the two 
populations is a 2 = 0.05, and the technical variance 
of = 0.0125, which gives the biological-to-technical vari- 
ance ratio A = a 2 /of = 4. The preset target of the 
experiment in this scenario is that the false positive rate 
being controlled at P t h — 0.001 and the power being no 
less than S = 0.95 to detect a two-fold differential gene 
expression, which corresponds to fj, = 1 with base 2 log- 
arithm 0. In Table |H we present results for different 
pooling parameter r. It can be seen from the first panel 
of this Table that in order to hit the preset target, the 
non-pooled design (r = 1) requires at least 12 biologi- 
cal subjects divided evenly to the two populations, i.e., 
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FIG. 1: The power S as a function of the total number of 
pools n c +n t . The parameters used are for the second scenario 
a 2 = 0.2, of = 0.05, A = a 2 /a 2 = 4, P th = 0.001, /x = 1, and 
m — 1. The five solid curves correspond to different levels of 
pooling, from right to left, r = 1, r = 2, r = 4, r = 6, and 
r — 15 respectively. The dashed line indicates the 95% power, 
the intersections of which with the power curves specify the 
total numbers of pools (assuming n c — n t ) needed to achieve 
the target power. The total number of biological subjects and 
the total number of arrays can then be calculated simply by 
N s = r(n c + n t ), and N m = m(n c + n t ) respectively. 



6 from each of the two populations. Having 7 subjects 
from one population and 5 subjects from the other is in- 
sufficient to achieve the target of 95% detection power. 
The effects of other levels of pooling on the detection 
power are also shown in Tabled The minimum number 
of biological subjects (N s ) and microarrays (N m ) that 
meet the preset targets are highlighted with bold fonts. 
It is clear that as the level of pooling is increased (with 
increasing r), the number of microarrays N m can be re- 
duced, but the number of biological subjects N s has to 
be increased. For example, in order to reduce the num- 
ber of arrays from 12 (Table [I] first panel) to 8 (Tabled 
fourth panel), the number of biological subjects to form 
the pools must be increased from 12 to 40. 

For the second scenario we consider the case a 2 = 0.2, 
a 2 — 0.05, which gives A = a 2 jo 2 — 4. Again the preset 
targets are to detect a true differential expression (i = 1 
with no less than 95% power while the false positive rate 
is set at Pth — 0.001. Using these parameters, the power 
S as a function of n c + n t is plotted in Fig^for different 
levels of sample-pooling. For the non-pooled design (r = 
1), N s = 30 total biological subjects and N m = 30 arrays 
are required to hit the preset targets. Similar to the first 
scenario, as the level of pooling is increased, the number 
of arrays N m is reduced while the number of subjects 
increased to meet the preset targets. 

In Table |Hj we summarize our exact results and the 
approximate results of • It can be seen that the differ- 
ence between the two can be very large, indicating the 
need for exact results. For example, in the first scenario 
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when N m = 8 the approximate result of predicts that 
a minimum of 21 biological subjects are required. In 
practice 24 subjects are required as 24 is the minimum 
number larger than 21 and divisible by 8. However this 
experiment setup (24 subjects forming 8 pools, 8 microar- 
rays) will only give a detection power of 90%. To meet 
the target power of 95%, 40 biological subjects are actu- 
ally required by our exact resul t. If an exp eriment with 
N m = 7 microarrays is planned, IShih et all predicts that 
37 subjects are required^, but in fact 126 subjects must 
be used to achieve the target. Generally, the approxi- 
mate formulas of are too optimistic in assessing the 
benefits of pooling samples and reducing the number of 
microarrays, because they underestimate the number of 
biological subjects required. 

B. Cost analysis 

Depending on the material costs involved in the bi- 
ological subjects and microarrays, the conditions where 
pooling samples becomes beneficial may be different from 
lab to lab. Here we show examples to determine these 
conditions. Denoting the cost associated with each bio- 
logical subject as C s (including materials and labor etc) 
and the cost associated with a microarray as C m , then 
the total costs for an experiment in microarray compar- 
ative study is Ct = N S C S + N m C m . Taking the first 
scenario as an example, the total cost of a non-pooled 
design to achieve our preset targets is 

C T (r = 1) = 12C S + 12C m , 

and the total cost for pooled design with r — 2 is 

C T (r = 2) = 20C S + 10C m . 

Therefore in order that the pooled design with r = 2 is 
beneficial we must have 

C T (r = 2) < C T (r = 1), (13) 

which requires that C m > 4C S . Put another way, only 
when the cost associated with one microarray C m is more 
than 4 times the cost of a subject C s , does the pooling 
design with r = 2 become preferable to the non-pooled 
design. Similarly a higher level of pooling with r = 3 
becomes preferable to r = 2 only when C m > 7C S . Fur- 
thermore the conditions for increasing the level of pooling 
from r = 3 to r = 5 are C m > 13C S , and so on. Table UTI 
gives these conditions for further levels of pooling. 

For the first scenario using the actual cost figures given 
in where C s = $230 and C m — $300, it can be seen 
that none of the pooling conditions is met. Therefore 
for this laboratory pooling samples is not recommended. 
However, if we use the cost figures of 01 where C s = $50 
and C m — $700, an optimal design is a pooled design with 
r = 5. 



For the second scenario, it is a similar story. The cost 
figures of Ref. @ (C s = $230 and C m = $300) gives 
C m = 1.30C S , which does not satisfy any of the pooling 
conditions. So again the non-pooled design with N m — 
30 and N s — 30 is recommended. On the other hand, 
the cost figures of [ijj (C s = $50 and C m = $700) give 
C m = 14C S which satisfies all the pooling conditions in 
the lower panel of Table [H] except the last row. So in 
Kendziorski et al's lab, the pooled design with N m = 14 
and N s — 84 would be recommended. 



V. DISCUSSION 

We have in this paper presented exact formulas for 
calculating the power of microarray experimental design 
with different levels of pooling. These formulas can be 
used to determine the conditions of statistical equivalence 
between different pooling setups. As in [ljj an d @> the 
calculations presented in this paper are for an individual 
gene, so the statistical equivalence for different designs 
of pooling can be determined with regard to one par- 
ticular gene. However, microarray monitors thousands 
of genes simultaneously, and the biological and techni- 
cal variances vary from gene to gene, therefore no single 
result of statistical equivalence between pooled and non- 
pooled designs applies equally to all genes on the array. 
So in practice how would the formulations in this work be 
used? One possible way, as suggested by Kendziorski et 
al. ^(ji i s to specify the distributions of a 2 and a t and 
calculate the total number of subjects and arrays that 
maximize the average power across the the array. In the- 
ory, if the biological variances and technical variances 
were known for all genes on the array, an equivalence 
condition between pooled and non-pooled designs could 
be determined for each gene individually. The overall 
(or say, average) equivalence condition between pooled 
and non-pooled designs could be obtained, for example, 
by some form of averaging operation over all genes. An 
alternative and probably more practical way is to use 
representative values of a 2 and a e . We therefore propose 
that parameters for "typical gene" be used as inputs for 
the power and sample size calculations. A typical gene is 
a gene whose biological and technical variance have the 
most probable values among the genes, i.e., the mode 
of the distribution for biological and technical variance 
of genes. Alternatively, the median or mean variances 
across genes could be used as representative values 0. 

An issue associated with microarray experiments is the 
problem of multiple inferences, where a separate null hy- 
pothesis is being tested for each gene. Given thousands 
of null hypotheses being tested simultaneously, the cus- 
tomary significance level a = 0.05 for declaring positive 
tests will surely give too many false positives. For ex- 
ample, if among a total number N = 10000 of genes 
being tested, No = 4000 are truly null genes (genes that 
are non-differentially expressed between the two classes), 
the expected number of false positive results would be 
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4000 x 0.05 = 200, which may be too many to be ac- 
ceptable. Thus a smaller threshold p-value for declar- 
ing differentially expressed genes should be used. Effec- 
tively controlling false positives in a multiple testing situ- 
ation such as microarray experiments is an area which has 
drawn much attention in recent years due to the wider ap- 
plication of microarray technology. As discussed in our 
previous work in |13j . generally speaking, all different 
multiple-testing adjustment methods eventually amount 
to effectively setting a threshold p-value, and then reject- 
ing all the null hypotheses with p-value below this thresh- 
old. The classical Bonferroni multiple-testing procedure, 
which controls family-wise error rate at a by setting the 
threshold Pth — ot/N, is generally regarded as being too 
conservative in the microarray context. The FDR (False 
Discovery Rate) idea, initially due to in dealing with 
the multiple testing problem, has now been widely ac- 
cepted as appropriate to the microarray situation. Re- 
cently, Efron [16j extended the FDR idea by defining fdr, 
a local version FDR (local false discovery rate). When 
planning microarray experiments in terms of power and 
sample size calculation, the FDR of |l5j is more appropri- 
ate and convenient to use. There are now in the literature 
a few slightly different variants of the definition of FDR 
[Tit IrR lisj , but in essence it is defined as the propor- 
tion of false positives among all positive tests declared. 
To provide an interface between FDR and the formula- 
tion in the previous sections, here we show that there is 
a simple correspondence between controlling FDR and 
specifying the traditional type I error rate and power. 



Suppose that there are a total number N of genes being 
monitored by microarray, so there will be N hypotheses 
being tested, one for each gene. Suppose that a fraction 
ttq of the N genes are true null genes, i.e., genes that 
are non-differentially expressed between the two classes. 
Given the type I error rate Pth, the expected number of 
false positive tests is Pi/jiWro; Given the power S, the 
expected number of non-null genes (truly differentially 
expressed genes) that are declared positive is SN(l—n ). 
So the FDR achieved by this setting is 

FDR PthNirp P th ir 

~ PthNlto + SN(1 - 7T ) ~ P th 7T + 5(1 - TTQ) ' 

(14) 

Here ttq is an important parameter in controlling FDR, 
for which several different methods of estimating this pa- 
rameter have been proposed 0, 0, 0] . Especially the 
method we presented in ^| is an accurate yet computa- 
tionally much simpler algorithm than the one proposed 
by Storey and Tibshrirani in With the interface 

Eq. (|14|) , FDR can be readily presented and incorporated 
into the calculations. 
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TABLE I: For the first scenario described in the text, the de- 
tection power of designs with different levels of pooling. The 
r — 1 panel represents that of non-pooled design. Other pa- 
rameter values are: a 2 = 0.05, a 2 = 0.0f25, A = a 2 /a 2 — A, 
and m — 1. N s = r(n c + nt) is the total number of biological 
subjects required, and N m — m(n c +n t ) is the total number of 
measurements (microarrays) needed, counting both the con- 
trol and the treatment populations. The preset targets are 
false positive rate being controlled at P t h = 0.001, to detect 
two- fold differential expression (/i = 1) with power no less 
than 0.95. The minimum number of biological subjects (N s ) 
and microarrays (N m ) that meet the preset targets are high- 
lighted with bold fonts. 
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TABLE II: Comparison of our exact results and the approxi- 
mate results of Ref . |!j ■ The upper panel of the table is for the 
first scenario, where a 2 — 0.05, a\ = 0.0125, A = a 2 /a 2 — 4. 
The lower panel is for the second scenario, where a 2 — 0.2, 
a 2 — 0.05, and A = a 2 /a 2 = 4. The targets of both scenarios 
are that the false positive rate P t h = 0.001 and the power 
no less than S = 0.95. The last column in each panel gives 
the cost conditions when pooling samples become beneficial 
relative to a lower level of pooling shown in this table. 
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